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Connectivity networks have recently become widely used in biology due to in- 
creasing amounts of information on the physical and functional links between in- 
dividual proteins. This connectivity data provides valuable material for expanding 
' our knowledge far beyond the experimentally validated via mathematical analysis 

^ ' and theoretical predictions of new functional interactions. In this paper we demon- 

D ' strate an application of several algorithms developed for the ranking of potential 

gene-expression regulators within the context of an associated network. We analyze 
how different types of connectivity between genes and proteins affect the topology 
of the integral C.elegans functional network and thereby validate algorithmic per- 
' formance. We demonstrate the possible definition of co-expression gene clusters 

■ within a network context from their specific motif distribution signatures. We also 
O ' show that the method based on the shortest path function (SPF) applied to gene 

• interactions sub-network of the co-expression gene cluster, efficiently predicts novel 

Q^' regulatory transcription factors (TFs). Simultaneous application of other methods, 
including only interactions with neighborhood genes, allows rapid ranking of poten- 

^ tial regulators that could be functionally linked with the group of co-expressed genes. 

0> ■ Predicting functions of regulators for a cluster of ribosomal/mRNA metabolic genes 

■ we highlight a role of mRNA translation and decay in a longevity of organisms. 

m 

O 
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I. INTRODUCTION 



Analysis of functional properties of biological systems requires the integration of both the 
^ ■ functions of their individual components and the properties of the components interactions. 
. Reconstruction of functional networks from the known pair-wise connectivity of biological 
molecules offers systems level insights into complex biologic phenomena. Though widely used 
and taken for granted the network-based approach requires very accurate choice of data to 
reflect a proper balance between maximizing connectivity on one hand, and preserving the 
reliability of network behavior on the other. 

Direct genetic and regulatory interactions as well as indirect indicators of functional 
links between proteins, such as co-expression, co-occurrence, gene-fusion, etc, are used to 
provide a generic topology of a network. It is known that subnetworks based on genetic 
and physical protein-protein interactions overlap only weakly [H, which perhaps indicates 
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different levels of functional organization within complex biological systems. However, it has 
been also documented that indirect indications or proxies of functional relevance between 
proteins, such as gene co-expression and genome co-localization are largely complementary 
and correlate well with ontology-based protein groupings j^l, 0] . It is not clear to what extent 
connectivities of different types affect the general architecture of an integrated network 
structure. 

The gaps in experimentally-derived knowledge on regulatory and structural features of 
biological systems can be filled to some extent by theoretical predictions. However, a greater 
understanding of principles of connectivity between biological functions at different levels of 
cellular organization is needed to avoid false extrapolations in proposed network structures 
or topologies. There is thus an immediate need for a means of evaluating the generally 
used organism-specific functional networks and understanding of their underlying properties. 
Here we show how the statistical analysis of network connectivity can be used to predict new 
gene expression regulators. We suggest that co-expression clusters can be easily identified 
as highly-connected islands within the integrated network, and that these islands can be 
used to suggest new regulatory genes for subsequent verification. For this, we propose a new 
application of modified statistical algorithm based on so-called "shortest path function" 
(SPF) to rank the nodes that have most effect on genes expression. This can be also applied 
to any explicitly defined group of genes. 

We retrieved gene co-expression clusters from existing large-scale microarray data on 
heat shock responses in C.elegans jsf and projected them onto subnetworks composed 
from a combination of different connectivity types (gene interactions, protein interactions, 
gene co-expression) in WormNet database of pair-wise undirected functional links between 
proteins jsj. Correlation between experimentally observed co-expression links and connectiv- 
ity of genes in Wormbase 0] is shown to be very strong especially for ubiquitously expressed 
ribosomal, proteosomal and exosomal gene clusters. Via application of SPF method we also 
predicted several transcription factors (TFs) as potential regulators of the analyzed groups 
of co-expressed genes. By searching for the most connected regulatory nodes within the 
neighborhood of each cluster we also noticed a strong link between nonsense mediated de- 
cay (NMD) as well as longevity-related genes and the highly connected clusters mentioned 
above. We suggest that these functional connections may explain a dependence of adult life 
span on the metabolism of polyamines. Interpretation of the organism-specific integral bio- 
logica networks and prediction of protein complexes and genetic regulators from a network 
context may benefit greatly from our study and the new algorithms. 



Microarray data are adopted from [8J where two parental C.elegans strains N2 (Bris- 
tol) and CB4856 (Hawaii) and the recombinant inbred strains were used to measure gene 



II. METHODS 



A. Data preparation 
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expression in a control conditions and after a heat shock. N2 and CB4856, represent two 
genetic and ecological extremes of C. elegans 9l-|l l| . Their genetic distance amounts to 



In 13 



about one polymorphism per 873 base pairs [l^. Both strains have contrasting behavioral 
phenotypes (solitary vers us g regarious) ll| and differ strikingly in their response to a tem- 
perature change 



the strains were exposed to 16 °C and 24 °C, temperatures 
that are known to strongly affect phenotypic characteristics such as body size, lifespan, and 
reproduction 

Gene expression patterns were assessed by oligonucleotide microarray hybridization. All 
microarray data have been deposited by the authors in NCBI's Gene Expression Omnibus 
(GEO, [l5|) and are accessible through the GEO Series accession number hsted under the 
Accession Numbers GSE5395. Absolute expression values have been used for KMC cluster- 
ing analysis by means of Mev4 application 16|] with requested 50 clusters/Euclidian distance. 
The resultant clusters were used as "co-expression" clusters in our analysis. 

String software 17|, ll8|] has been used to reconstruct graphical networks from the sets of 
C. elegans genes. WormBase database [tI has been used for ID retrieval and translation of IDs 
into gene names, associated functional annotations and ontological categories. WormNet 0] 
has been used a source of information on pair-wise interactions between genes. We also used 
it to retrieve data on separated genetic interactions of C. elegans genes and co-expression 
links in C. elegans. 



B. Methods of statistical analysis of network connectivity 

To investigate the connectivity properties of the gene clusters on WormNet and to estab- 
lish potential regulators for the gene clusters we use the following approach. Denote by di^M 
the shortest path along the network from a given vertex (regulatory gene) M to some other 
vertex (cluster gene) i. Consider the shortest path function (SPF) determined as follows 



N 

V.SPF 

N ^ d,M 



1 ^ 1 



where the summation is performed over all cluster genes (A^ is the number of the genes in 
the cluster, i.e. the cluster size). The connectivity of the gene cluster on the whole network 
and/or on its subnetworks we describe by SPF defined for all cluster genes 



N 

USPF _ ^ 



cf - (2) 



here A^ is the number of the genes in the cluster, dij is the shortest path between the nodes 
i and j. Thus defined, the SPF has a very transparent meaning, since it gives the averaged 
reciprocal paths between pair of cluster genes. If i and j are not linked on the network, 
the contribution to SPF from this pair {i,j) equals to zero, while the maximal contribution 
is reached for directly linked pairs. So, SPF can be used to characterize quantitatively 



4 O.V. Valba, S.K. Nechaev, O. Vasieva 



the connectivity of a gene cluster in a given network. For comparison we use also another 
method for determination the cluster connectivity. The so-called "connectivity coefficient" 
is defined as a ratio between the number of inner links (which connect only cluster genes) 
to the number of all links which the cluster has in the network: 

fci = ^ (3) 

We consider WormNet and its parts as unweighted undirected network. Note that SPF 
analysis can be easily extended for weighted and directed networks as well. 

Another approach to characterize a connectivity (and topology) of a subnetwork (cluster) 
is to investigate its motifs distribution. A network is a set of vertices (or nodes) and con- 
nections between them (links or edges) without self-connections and multiple edges. The 
network is random if any link occurs independently on others with a certain probability. 
The network is directed if any link either has an orientation i — )■ j, or is bidirectional i ^ j. 
Otherwise the network is non-directed. The network is set by the adjacency matrix. The 
local topological properties of networks, both directed and non-directed, for given number 
of vertices and vertex degree distribution are characterized by the rates of connected sub- 
graphs. Since the number of such subgraphs grows combinatorially with their sizes, usually 
only small subgraphs are considered. In particular, in the works 19142 1| only subgraphs of 
size 3 and 4 have been analyzed for directed and undirected networks. 

The rates of subgraphs in a given network depend on the vertex degree distribution. This 
complicates the comparison of networks of different sizes and of different degree distributions 
by the rates of their subgraphs. In order to compensate these differences, the procedure of 



so-called network randomization was proposed in works [20|, UM- this procedure the 
network experiences multiple permutations of links under the condition of conservation in 
each vertex the number of incoming, outgoing and bidirectional links. Using this method 
an ensemble of randomized versions of a given network is generated, and for every subgraph 
the statistical significance 

Zk = ; k = [l...m\ (4) 

is calculated, where is the amount of k-th subgraphs in the initial network; and (Nk) and 
(Tfc correspondingly the mean and the standard deviation of subgraphs of given type in the 
randomized networks; m is the total number of considered subgraphs. Subgraphs with the 



statistical significance essentially exceeding 1 are called motifs [19H21|. The distribution of 
motifs in the network under consideration is characterized by the significance profile which 
is a normalized vector x = {xi, ....,Xm} of statistical significance of all subgraphs of given 
size. The components of the vector x are 

Xk = I ; k = [l...m] (5) 
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C. Prediction of potential regulators for the gene clusters 

We use the SPF coefficients kff^ , determined in ([1]) for a search for potential regulators 
of co-expressed gene clusters. Note, that potential regulators can belong to the cluster or 
lie outside of it. For the search of the potential cluster regulators it is important that the 
regulator is specific to it. It has many connections to the gene cluster but at the same 
time is weakly connected to other genes in the network, which the cluster belongs to. In 
other words, the potential regulator must have enough links to control the cluster not being 
herewith the hub of the network. We introduce the value which takes into account the 
connectivity of the regulatory gene in the whole network: 

SPF ^=l ^^'^ (p. 

^ di,M 
1=1 ' 

By definition, the Kff^ is the ratio of the sum of the shortest paths from regulator to the 
cluster genes, and the sum of the shortest path from regulator to all genes in the network, 
one sees, that Kff^ measures how specific is the regulator to the cluster. The function ([T]) 
can be considered as a ffist term in expansion of the function ([6]). We use this value for 
the search of potential regulators to the gene clusters in the network and its subparts. We 
compare this method with some other described below. 

Namely, for any potential regulator we define the fraction of the cluster genes, which are 
connected to it: 

/« = ^ m 

Analogously to (E]) we introduce the value, which takes into account the "interaction" of the 
potential regulator with outer part of the cluster genes: 

Here, Nnj{G) is the number of all links, which the potential regulator M has in the whole 
network G. 



III. RESULTS 



A. Statistical properties of co-expression clusters 



Our first goal is the investigation of the properties of extracted gene clusters in the whole 
network WormNet. The Fig. [T] presents the dependencies of the cluster's connectivity and 
SPF on the cluster's size (the number of genes included in the cluster), as well as the motif's 
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distribution for some clusters. Connectivity coefficient is generally proportional to a cluster 
size. In average, an expression cluster is composed by genes connected in WormNet stronger 
than any group of randomly selected genes - see Fig. [T]A.. The SPF values for all gene clusters 
lie much higher than the SPF coefficients for random clusters shown in the Fig. [TJ3 (here and 
below: a random cluster is a cluster with randomly chosen nodes in a WormNet). The most 
connected clusters are bounded mainly by co-expression links in the WormBase Fig. [Tp. We 
have selected the clusters with very high connectivity coefficients. Such clusters are well- 
connected and are depicted by red points in the graphs. These clusters also are characterized 
by different motif's distribution with a prevail of linear 4-nodes chains Fig. [Tp. According 
to the motif's distribution of gene clusters, we can divide them into two groups. The ffist 
group includes the gene clusters, in which the fully connected subgraphs dominate and all 
well-connected clusters belong to this group. The second group is formed by the clusters 
with a small number of fully connected motifs. It is worth noting, that protein structure 



networks are also characterized by the same motif distribution 20 



The number of co-expression and physical connections strongly prevail over genetic in- 
teractions in the WormBase, which emphasizes the necessity in additional experimental 
analysis, or in-situ prediction of potential regulatory modules for C.elegans. Our analysis 
shows that the motif's profiles for expression sub-network of gene clusters are very close to 
the profiles in the whole WormNet Fig. [T]C,F. It means that in our clusters the co-expression 
links prevail over other links. This observation provides a proof that genes of well-connected 
clusters are mainly connected to each other by co-expression. Also, our analysis demon- 
strates that expression links are responsible for the existence of the cycles in well-connected 
clusters Fig. [TF. 




Figure 1: The connectivity coefficient (A and D), SPF (B and E) and motif distribution (C and 
F) in the whole WormNet (up) and in its expression subpart (below). 
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B. Prediction of expression cluster regulators 

We have used the most connected cluster 1 to find an optimal algorithm to predict 
potential regulators of mRNA pool for the groups of co-expressed genes. Two different 
methods have been tested. 

The first method is based on ranking the network nodes according to a fraction of clus- 
ter genes (FCG) they are directly connected to. This method requires dense connectivity 
matrix to be efficient. The predicted functions are expected to have a strong and specific 
involvement in regulation of a particular co-expressed gene cluster. However, the ranked 
gene list needs filtering to eliminate a malfunctioning. 

The second method is based on ranking nodes according to the shortest average dis- 
tance to all cluster genes (SPF method). As in the first method, the obtained gene lists 
needs filtering. From the Fig. [T] one can see that the clusters have different topologies in 
different connectivity subnetworks, with the genetic interactions subnetwork being the least 
connected. The power of the SPF method is that it compensates the absence of knowledge 
about regulation of many genes and quantitatively validates the probability of co-regulation 
of a group of genes by different nodes. SPF method does not require a matrix to be dense 
and can be applied to a subnetwork of genetic interactions producing the list of regula- 
tors without filtering. FCG and SPF methods are applied to a total network and to the 
co-expression connectivity subnetwork with similar outcomes. The top of the predicted 
regulators is presented in Table [B 

Several predicted regulators are well integrated both in the top cluster networks and in the 
network reconstructed from the genes with a longevity phenotype retrieved from WormBase. 
Among the most promising predicted regulators that could connect the top clusters with 
a longevity regulation are: daf-2, iff-1, cgh-1, tin9.2, car-1. They all are related to mRNA 
processing/translation/decay and are in a cross-talking relationship (Table [T]). According 
to their position in the network, they may play a role of linkers between top connected 
co-expression clusters related to ribosomal biogenesis, proteosome and central metabolic 
functions (Fig 1). The SPF method has also been applied to the gene regulatory network. 
The results of this analysis (the top ranked regulatory nodes) are summarized in the Table 
Hn Note that the SPF method allows us to predict some regulators which we could not detect 
by the FCG method, and the power of this method had been enforced by its application to a 
gene regulatory connectivity subnetwork, as it is clearly demonstrated in Table [Tll We could 
identify a number of TFs that may be considered for a role of transcription regulators of genes 
in the top co-expression cluster 1, such as: F30F8.8, transcription initiation factor TFllD 
subunit 5; R74.3, xbp-1, heat-shock transcription factor; F02E9.4, sin-3-histone deacetylase 
subunit; 33A8.1, pre-mRNA-splicing factor CWC22. This method also greatly increased the 
ranking position of daf-2 and genes upstream daf-2 (C25A1.10) or being directly affected 
by daf-2 mutation (C05C8.3) an immediate potential connection to a group of genes with 
longevity phenotype that have a strong overlap with our cluster 1. 

Here we do not consider exclusion of the hub-regulators by the procedures (jnD for SPF 
method and ([H]) for FCG method. Our statistical analysis demonstrates that the most 
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Seq. IDs 


Gene 


Function 


F57B9.6 


inf-1 


Transl. initiation/ RNA transport 


T05G5.10 


iff-1 


Transl. initiation/ NMD 


Y71G12B.8 


Y71G12B.8 


RNA helicase/ RNA transport 


T10C6.14, T10C6.12, T10C6.il, F45F2.3, 
F45F2.4, F45F2.12, ZK131.4, ZK131.6, 
ZK131.8, ZK131.10, K06C4.10, K06C4.il, 
K06C4.4, K06C4.3, K06C4.12, ZK131.1, 
K06C4.2, F35H10.1, F17E9.12, F17E9.13, 

x^DUr ^. / , iVWO-rll.D, V_yOUr^.O, rUovjZ.Z, 

Rno'^'^ Q Rnri'^'^ 7 Fn7R7 q Fn7R7in 

Fn7R7 4 Fn7R7 Fn7R7 1 1 F^/1F,19 
F,^4F,12,^ F.^.^CI 11 F.^.^CI 10 F?2R3 1 
H02I12 7 T23D8 5 T23D8 6 


38 His genes 


Histones 


C41D11 2 

Vw^ X X-/ J L • ^ 


eif-3.H 


Tr^insl initiation 

X CXXJ.O J. • U J.C^ U J. v^XJ. 


F32E10 1 

X 1^ ^ J J X vy • X 


nol-10 


nil flpolriT Drotpi'n Dolvp'ln t, HiTirl in p" 


F54H12 6 

X (^^tXXX^tV./ 


eef-lB.l 


A JXv_y XX t-.CAJ U iV_y XX X CXjV-- U v^X 


C01F6 5 


alv-1 


RNA exnort 


Ml 63 3 

± vx X v./ • 


his-24 


XXXO Uv-/XXV_^0 


B0564.1 


tin-9.2 


decay/ NMD 


Y18D10A.17 


car-1 


decay / decapping 


F56D12.5 


vig-1 


RISC component/miRNA binding 


F26D10.3 


hsp-1 


splicing 


R04A9.4 


ife-2 


Transl. initiation 



Table I: Top of the predictable regulators for test cluster Nl by FCC method 



connected components of WormNet are included in the test cluster Nol, so the methods (JH]) 
and ([S]) give about the same list of potential regulators as FCG and SPF method respectively; 
the ranging order is slightly changed, for example, in a green frame in the figure we show gene 
(exc-7) that has been excluded in our SPF-hub-exclusion algorithm because its connectivity 
exceeds a fixed threshold. 

The Fig. [2] illustrates a typical position of the predicted potential regulators for the Cluster 
1. Nodes predicted by FCG method (purple frame are proximal to the cluster or even inside 
the cluster. The nodes predicted by SPF method may be significantly distant from the 
many nodes in the cluster (ces-1, eor-1, orange frames on Fig. [2]). Though the connections 
between the SPF-predicted node and the cluster may include several intermediate steps, 
the majority of these steps do contain the nodes that can translate signals at the level of 
mRNA pool regulation, potentially representing complexes of proteins with a joint regulatory 
performance. As one sees, the type of connectors utilized by different algorithms differs 
essentially: experimental, regulatory connections are fundamental for SPF and more dense, 
co-expression ones, for FCG. 
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Sea IDs 




in pi", ion 


Y55D5A 5 B0334 8 Y116F11B 1 


daf-2 a^el daf-28 


insnlin /ap'inp' 


F35H8 5 


6XC-7 


TTiRlVA ■nrorpssinff 


W10D5 1 


niGf-2 


TF 


C17D12 2 






C47G2 2 


nn f-l .SO 


TF 


F30F8 8 


TFIID 


Transl iTiitiatiori 


R74.3 


xbp-1 


TT^ ViistoTip morlnlatinTi 


F33A8 1 


cwc22 




C41C4.4 


xrG-1 


( IV A ■nrnrpssin p' 1 Hprav / ■nrorpssin p" 


C37H5 8 


lisp-6 


T^prav 


C26D10 2 


1ip1-1 ( hpli rasp 1 


DNA helicasp 


C07H6 5 


ppfb-l ( Hpranninp' ) 


Hppav / HpraDDinp' 


F02E9 4 


sin-3 fHDACl 


liistonp mnHnlation 


M163 3 


liis- 1 


TTistoTiP 


C25A1 10 


clcio-5 


vT^lVA transrriDt.ioTi /ap'inp' 


ZC247 3 


lin-11 


TF 


R107.8 


lin-12 


TF 


C05D9.5 


ife-4 


Transl. initiation 


R11E3.6 


eor-1 


TF 


F43G9.11 


ces-1 


TF 


ZK909.4 


ces-2 


TF 



Table II: Top of the predictable regulators for test cluster 1 by SPF method on genetic subnetwork 

IV. DISCUSSION 



Characterizing the degree of connectivity for a given gene to a specific set of genes in 
the network as a normalized sum of inverse distances in a network, is very natural and 
brings us back to the applications of "harmonic means" for graph analysis . The main 
conjecture behind the application of harmonic mean to networks is as follows; if a vertex 
has multiple links to other vertices, the information is sent "in parallel", i.e. concurrently 
along the network. Thus, one can define the "efficiency", ejj in communication between 



vertices i and j as the inverse of the shortest distance, i.e. Cij = 1/dij, see [22[. The 
average efficiency is straightforwardly related to the definition of SPF. It should be noted 
that the interpretation of the SPF function as the efficiency of communication could be very 
useful in further dynamic analysis of the networks. Actually, let v be the velocity, with 
which the information travels along the network, then the amount of information sent from 
the node i to the node j per unit time is just v/dij . The performance, P, is the total 



amount of information propagating over the network per unit time [22| . In our forthcoming 
works we plan to analyze the clusters taking into account their limited speed of information 
propagation. The concept of performance seems very appropriate for that. 
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Figure 2: Connectivity between the predicted regulators and Cluster 1 in STRING Network 
browser. Evidence view for high confidence (0.700) connections. Pink connectors - experimentally 
derived interactions, black-co-expression, and green-co-localization in genomes, blue-co-occurrences 
in genomes. Colored circles- input genes, white circles-the most associated additional nodes (set 
number of 200) automatically added by a STRING software on a request to increase a connectiv- 
ity between uploaded functions. Predicted potential regulators are shown in frames: orange-SPF 
method, purple-FCG method, green node excluded in hub-exclusion SPF method. 



Well-connected clusters of co-expressed genes described in this paper largely represent 
protein functional complexes, and they can be distinguished by the presence of a specific well- 
connected-6 link (unoriented) motif. This highly-connected motif can be used for detection 
of protein functional complexes (islands) in integral networks. These islands, in turn, serve 
in prediction of new regulatory nodes. In this study we used gene clusters derived from gene 
absolute expression values data that probably increase detection of true protein complexes 



expressed from indeed highly co-regulated genes 23 



Among the most interconnected clusters are the ones for ribosomal proteins and the 



On prediction of regulatory genes by analysis of C.elegans functional networks 11 



regulation of translation, proteosome, respiratory complex 1 and several central metabolic 
functions. Using the most interconnected cluster 1 we tried to detect potential regulators 
from the associated network context. Due to a non-directional nature of the edges in Worm- 
Base and an absence of directions in the co-expression network we were unable to distinguish 
between a cause, consequence or undirected physical interaction in a connected pair of pro- 
teins/functions and may only suggest the presence of the functional linkage between the 
expressed genes and the regulators. However, additional data from literature mining will 
likely help to vectorize the predicted interactions. 

The cluster, which we used for the method validation, contains a large number of genes 
involved in the translational machinery as well as several genes with a central metabolic role, 
among which we found a large number of genes associated with a longevity phenotype in 
WarmBase database (Table [T]). The protein translation processes indeed have been recently 
considered for a central role in the regulation of aging processes 2J, |25|] and we assumed 
that the regulators predicted in this study may also be linked to the processes underlying 
aging and the control of longevity. 

The role of regulation of translational machinery by the insulin pathway in aging has been 
widely discussed in literature |26|, |27| , however, the regulatory modules affecting the expres- 
sion of the related genes downstream of daf-2 have not been clearly defined. The candidates 
suggested by the FCG algorithm, iff-1 and bir-2 were shown to depend on daf-16-insulin 
response 28| and iff-1 has also been detected in gene expression screen for the longevity 
phenotype in C/elegans 25|. iff-1 is a eIF-5A homolog 29|], and eIF-5A that links processes 



of mRNA translation to the nonsense-mediated mRNA decay (NMD) [SOj. Activation of 
eIF-5A requires posttranslational modification of one of the protein's lysines into hypusine, 
and the enzyme that catalyses the first step of this modification, deoxyhypusine synthase. 



performs the NAD-dependent oxidative cleavage of spermidine 3l|- Spermidine is known 



to be involved in life span regulation and reproduction in a range of different organisms 



32|-l34j . though its mechanism of action is not clear. We suggest that its stimulatory role 



in NMD via regulation of eIF-5A may be of importance in regulating translation and as a 
consequence the life span of an organism. Interestingly, the ribosome maturation as well as 



the mRNA binding SBDS protein 35|, l36|] that is linked in a network to iff-1 and tin9.2, are 



both required for the longevity phenotype of daf-2 [37 



Our analysis points to a potential role of mRNA decay processes downstream of the 
insulin-dependent pathway in regulating translation and longevity. We suggest that elements 
of translational machinery, that are regulated via insulin/caloric restriction, may be indeed 
non-responsive to temperature changes as we see on the example of the Cluster 1 genes. Such 
persistence of expression may require specific mechanisms of adjustment to altered kinetics of 
biochemical reactions and may indeed involve a regulated mRNA decay process. Homeostasis 
of pathways regulated by nutrients supply regardless the temperature may lead to a very 
species-specific dynamic of cell survival and growth and an organism's life span adapted to 
the specific ecological dynamics of nutrients flow. The transcription factors predicted in our 
study by SPF method may also occur to be involved in aging and regulation of longevity . 



The genes cgh-1 |383, dao-5 [39|, hel-1 40| were already linked to aging processes downstream 
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daf-2, daf-16, and in case of dao-5 - to a daf-16 independent pathway associated with 
determination of the adult life span GO-term in WormNet database. Analysis of genotype- 



phenotype relationships j41[ when more data for the listed genes are available would allow 
deeper understanding of the direction of the defined links and more narrow prediction of 
their function. 

This work provides some new insights to the structure of biological functional networks 
and highlights the aspects that need to be considered in prediction of regulatory nodes, 
protein complexes and regulatory modules from a multilevel network context. We hope that 
it could be useful in application to analysis of other organism's networks and for improvement 
of analytic methods and software in the relevant fields. 
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